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The invention relates to polynucleotides coding for a fuco- 
syl transferase. Furthermore, the invention relates to partial 
sequences of these polynucleotides as well as to vectors compris- 
ing these polynucleotides, recombinant host cells, plants and in- 
seicts transfected with the polynucleotides or with DNA derived 
therefrom, respectively, as well as to glycoproteins produced in 
these systems. 

Glycoproteins exhibit a variety and complexity of carbo-hy- 
drate units, the composition and arrangement of the carbohydrates 
being characteristic of different organisms. The oligosaccharide 
units of the glycoproteins have a number of tasks, e.g. they are 
important in regulating metabolism, they are involved in trans- 
mitting cell-cell interactions, they determine the circulation 
periods of proteins in circulation, and they are decisive for 
recognizing epitopes in antigen-antibody reactions. 

The glycosylation of glycoproteins starts in the endo-plas- 
matic reticulum (ER) , where the oligosaccharides are either bound 
to asparagine side chains by N-glycosidic bonds or to serine or 
threonine side chains by O-glycosidic bonds. The N-bound oligo- 
saccharides contain a common core from a penta-saccharide unit 
which consists of three mannose and two N-acetyl glucose amine 
residues. To further modify the carbohydrate units, the proteins 
are transported from the ER to the Golgi complex. The structure 
of the N-bound oligosaccharide units of glycoproteins is deter- 
mined by their conformation and by the composition of the glyco- 
syl transferases of the Golgi compartments in which they are 
processed . 

It has been shown that the core pentasaccharide unit in the 
Golgi complex of some plant and insect cells is substituted by 
xylose and otl,3-bound fucose (P. Lerouge et al . , 1998, Plant Mol . 
Biol.. 38, 31-48; Rayon et al . , 1998, L. Exp. Bot. 49, 1463-1472). 
The heptasaccharide "MMXF 3 " forming constitutes the main oligo- 
saccharide type in plants (Kurosaka et al . , 1991, J. Biol. Chem. , 
266, 4168-4172). Thus, e.g., the horseradish peroxidase, carrot 
(3-f ructosidase and Erythrina cristagalli comprise lectin as well 
as the honeybee venom phospholipase A2 or the neuronal membrane n 
glycoproteins from insect embryos al, 3-fucose residues which are 
bound to* the glycan core. These structures are also termed com- 
plex N-glycans or mannose-def icient or truncated N-glycans, re- 1 
spectively. The a-mannosyl residues may be further replaced by 
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GlcNAc, to which galactose and fucose are bound so that a struc- 
ture is prepared which corresponds to the human Lewis a-epi- 
tope (Melo et al., 1997, FEBS Lett 415, 186-191; Fi tchette-Laine 
et al., 1997, Plant J. 12, 1411-1417). 

Neither xylose nor the al,3-bound fucose exist in mammalian 
glycoproteins . It has been found that the core-al , 3-f ucose plays 
an important role in the epitope recognition of antibodies which 
are directed against plant and insect N-bound oligosaccharides 
(I.B.H. Wilson et al., Glycobiology Vol . 8, No. 7, pp. 651-661, 
1998) , and thereby trigger immune reactions in human or animal 
bodies against these oligosaccharides . The al , 3-f ucose residue 
furthermore seems to be one of the main causes for the wide- 
spread allergic cross reactivity between various plant and insect 
allergens (Tretter et al . , Int. Arch. Allergy Immunol. 1993; 
102:259-266) and is also termed "cross-reactive carbohydrate de- 
terminant" (CCD) . In a study of epitopes of tomatoes and grass 
pollen, also al,3-bound fucose residues were found as a common 
determinant, which seems to be the reason why tomato and grass 
pollen allergies frequently occur together in patients (Petersen 
et al., 1996, J. Allergy Clin. Immunol., Vol. 98, 4; 805-814). 
Due to the frequent occurrence of immunological cross reactions, 
the CCDs moreover mask allergy diagnoses. 

The immunological reactions, triggered in the human body by 
plant proteins are the main problem in the medicinal use of re- 
combinant human proteins produced in plants. To circumvent this 
problem, al , 3-core-f ucosylation would have to be prevented. In a 
study it could be demonstrated that oligosaccharides comprising 
an L-galactose instead of an L-f ucose ( 6-deoxy-L-galactose) nev- 
ertheless are biologically fully active (E. Zablackis et al . , 
1996, Science, Vol. 272). According to another study, a mutant of 
the plant Arabidopsis thaliana was isolated in which the N-ace- 
tyl-glucosaminyl transferase I, the first enzyme in the biosyn- 
thesis of complex glycans, is missing. The biosynthesis of the 
complex glycoproteins in this mutant thus is disturbed. Neverthe- 
less, these mutant plants are capable of developing normally un- 
der certain conditions (A. Schaewen et al , 1993, Plant Physiol. 
102; 1109-1118) . 

To purposefully block the binding of the core-al , 3-f ucose in 
an oligosaccharide without also interfering in other glycosyla- 
tion steps, merely that enzyme would have to be inactivated which 
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is directly responsible for this specific glycosylation, i.e. the 
core-al, 3-f ucosyl transferase. It has been isolated and charac- 
terized for the first time from mung beans, and it has been found 
that the activity of this enzyme depends on the presence of non- 
reducing GlcNAc ends (Staudacher et al . , 1995, Glycoconjugate J . 
12, .780-786). This transferase which only occurs in plants and 
insect, yet not in human beings or in other vertebrates, would 
have to be inactivated on purpose or suppressed so that human 
proteins which are produced in plants or in plant cells or also 
in insects or in insect cells, respectively, do no longer com- 
prise this immune-reaction-triggering epitope," as has been the - 
case so far. 

The publication by John M. Burke "Clearing the way for ribo- 
zymes" (Nature Biotechnology 15:414-415; 1997) relates to the 
general mode of function of ribozymes. 

The publication by Pooga et al., "Cell penetrating PNA con- 
structs regulate galanin receptor levels and modify pain trans- 
mission in vivo" (Nature Biotechnology 16:857-861; 1998) relates 
to PNA molecules in general and specifically to a PNA molecule 
that is complementary to human galanin receptor type 1 mRNA. 

US 5,272,066 A relates to a method of changing eukaryotic 
and prokaryotic proteins to prolongue their circulation in vivo. 
In this instance, the bound oligosaccharides are changed with the 
help of various enzymes, among them also GlcNAc-0tl^3 ( 4 ) -f ucosyl 
transferase. 

EP 0 643 132 Al relates to the cloning of an al , 3-f ucosyl 
transferase isolated from human cells (THP-1) . The carbohydrate 
chains described in this publication correspond to human sialyl 
Lewis x- and sialyl Lewis a-oligosaccharides . The specificity of 
the enzyme from human cells is quite different than that of fuco- 
syltransf erase from plant cells. 

It is an object of the present invention to clone and to se- 
quence the gene which codes for a plant f ucosyl transferase, and 
to prepare vectors comprising this gene, DNA fragments thereof 
or an altered DNA or a DNA derived therefrom, to transfect plants 
and insects as well as cells thereof with one of these vectors, 
to produce glycoproteins that do not comprise the normally occur- 
ring al , 3 -core-f ucose , as well as to provide corresponding meth- 
ods therefor. 

The object according to the invention is achieved by a DNA 
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molecule comprising a sequence according to SEQ ID NO: 1 (in this 
disclosure also the IUPAC code has been used, "N" meaning inosin) 
with an open reading frame from base pair 211 to base pair 1740 
or being at least 50% homologous to the above sequence or hybrid- 
izing with the above-indicated sequence under stringent condi- 
tions, or comprising a sequence which has degenerated to the 
above DNA sequence due to the genetic code, the sequence coding 
for a plant protein which has fucosyl transferase activity or is 
complementary thereto. 

This sequence which has not been described before can be 
perfectly used for any experiments, analysis and methods for pro- 
duction etc. which relate to the plant fucosyl transferase activ- 
ity. Here the DNA sequence as well as the protein coded by this 
sequence are of interest. However, in particular the DNA sequence 
will be used for the inhibition of the fucosyl transferase activ- 
ity. 

The open reading frame of the SEQ ID NO: 1 codes for a pro- 
tein with 510 amino acids and with a theoretical molecular weight 
of 56.8 kDa, a transmembrane portion presumably being present in 
the region between Asn3 6 and Gly54. The calculated pi value of 
the encoded protein of the sequence according to SEQ ID NO: 1 is 
7 . 51 . 

The activity of the plant fucosyl transferase is detected by 
a method and measured, the fucosyl transferase being added to a 
sample comprising labelled fucose and an acceptor (e.g. a glyco- 
protein) bound to a carrier, e.g. Sepharose. After the reaction 
time, the sample is washed, and the content of bound fucose is 
measured. The activity of the fucosyl transferase in this case is 
seen as positive if the activity measurement is higher by at 
least 10 to 20%, in particular at least 30 to 50%, than the ac- 
tivity measurement of the negative control. The structure of the 
glycoprotein may additionally be verified by means of HPLC . Such 
protocols are prior art (Staudacher et al . 1998, Anal. Biochem. 
246, 96-101; Staudacher et al . 1991, Eur. J. Biochem. 199, 745- 
751) . 

For example, fucosyl transferase is admixed to a sample com- 
prising radioactively labelled fucose and an acceptor, e.g. 
GlcNAcpl-2Manotl-3 (GlcNApl-2Manal-6 ) Man(3l-4GlcNAcPl-4GlcNAc(3l-Asn . 
After the reaction time, the sample is purified by anion exchange 
chromatography, and the content of bound fucose is measured. 
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From the difference of the measured radioactivity of the sample 
with acceptor and that of a negative control without acceptor, 
the activity can be calculated. The activity of the fucosyl 
transferase is already evaluated as positive if the radioactivity 
measured is at least 30-40% higher than the measured radioactiv- 
ity of the negative sample. 

The pairing of two DNA molecules can be changed by selection 
of the temperature and ionic strength of the sample. By stringent 
conditions, according to the invention conditions are understood 
which allow for an exact, stringent, binding. For instance, the 
DNA molecules are hybridized in 7% sodium dodecyl sulfate (SDS) , 
0 . 5M NaP04, pH 7.0, ImM EDTA at 50 o C, and washed with 1% SDS at 
42°C. 

Whether sequences have an at least 50% homology to SEQ ID 
NO: 1 can be determined e.g. by means of the program FastDB of 
EMBL or SWISSPROT data bank. 

Preferably, the sequence of the DNA molecule of the inven- 
tion encodes a protein with a GlcNAc-ocl , 3-f ucosyl transferase ac- 
tivity, in particular with a core-ocl , 3-f ucosyl transferase 
activity. 

As described above the core of al,3-fucosyl transferase is 
present in plants and insects, however, not in the human body, so 
that in particular this DNA sequence is useful in analysis and 
experiments as well as methods for production which are fucosyl 
transferase specific . 

By a core-ocl , 3-f ucosyl transferase, in particular GDP-L- 
Fuc : Asn-bound GlcNAc-ocl , 3-f ucosyl transferase is understood. 
Within the scope of the present invention, the term al,3-fucosyl 
transferase as a rule particularly means core-ocl,3 fucosyl trans- 
ferase. For the above-described activity measurement, in particu- 
lar acceptors having a non-reducing GlcNAc terminus are used. 
Such acceptors are, e.g., GlcNAcpl-2Manal-3 (GlcNAcpl-2Manal- 
6)ManPl-4GlcNAcpl-4GlcNAcpl-Asn, GlcNAcpl-2Manal-3 (GlcNAcpl- 
2Manal-6)Manpl-4GlcNAcpl-4 ( Fucal-6 ) GlcNAcpl-Asn and GlcNAcPl- 
2Manal-3 [Manal-3 (Manal-6 ) Manal-6 ] Manpl-4GlcNAcpl-4GlcNAcPl-Asn . 
Whether the fucose is bound or not can furthermore be determined 
by measuring the insensitivi ty relative to N-glycosidase F, which 
can be detected by means of mass spectrometry. 

Preferably, the DNA molecule according to the invention com- 
prises at least 70-80%, particularly preferred at least 95%, ho- 



- 6 - 



mology to the sequence according to SEQ ID NO: 1. This sequence 
codes for a particularly active GlcNAc-al , 3-f ucosyl transferase. 

Since the DNA sequence can be more or less changed according 
to the plant or the insect a sequence which shows, for example, 
70 % homology to a sequence according to SEQ ID No 1 has also a 
fucosyl transferase activity which is sufficient in order to be 
used in analysis, experiments or methods of production as above 
described. 

According to a further advantageous embodiment , the DNA 
molecule _ comprises 2150 to 2250,_ in particular 2198, base pairs. 
This DNA molecule comprises 100 to 300, preferably 210, base 
pairs upstream in front of the start codon, as well as 350 to 
440, in particular 458, base pairs downstream after the stop co- 
don of the open reading frame, wherein the end of the DNA mole- 
cule preferably comprises a 3 ' -poly (A) -tail . In this manner, a 
faultless regulation on translation level is ensured and a DNA 
molecule is provided which is particularly efficient and unprob- 
lematic for the coding of an active GlcNAc-al , 3-f ucosyl transfe- 
rase. 

The present invention moreover relates to a DNA molecule 
which comprises a sequence according to SEQ ID NO: 3 or compris- 
ing a sequence having at least 85%, particularly preferred at 
least 95%, in particular at least 99%, homology to the above- 
identified sequence or which, under stringent conditions, hybrid- 
izes with the above-indicated sequence or which has degenerated 
to the above-indicated DNA sequence due to the genetic code. The 
homology preferably is determined with a program which recognizes 
insertions and deletions and which does not consider these in the 
homology calculation. This nucleotide sequence codes for a con- 
served peptide motif, which means that the plurality of the ac- 
tive and functioning GlcNAc-al , 3-f ucosyl transferases comprises 
the amino acid sequence encoded thereby. In this instance, the 
sequence may either have the same size as the sequence according 
to SEQ ID NO: 3, or, of course, it may also be larger. This se- 
quence has a smaller length than the sequence which codes the 
complete protein and is therefore less sensitive with respect to 
recombination, deletion, or any other mutations. Due to the con- 
servative motif and its higher stability this sequence is par- 
ticularly advantageous for sequence recognising test . 

SEQ ID NO: 3 comprises the following sequence: 



5 ' -GAAGCCCTGAAGCACTACAAATTTAGCTTAGCGTTTGAAAATTCGAATGAGGAAG 
ATTATGTAACTGAAAAATTCTTCCAATCCCTTGTTGCTGGAACTGTCCCT-3 ' 

In a further aspect, the present invention relates to a DNA 
molecule which comprises a partial sequence of one of the above- 
indicated DNA molecules and has a size of from 20 to 200, pref- 
erably from 3 0 to 50, base pairs. The DNA molecule may, e.g., be 
utilized to bind, as a probe, to complementary sequences of 
GlcNAc-ocl , 3-fucosyl transferases so that they can be selected 
from a sample. In this manner, further GlcNAc-al , 3-fucosyl trans- 
ferases from the most varying plants and insects can be selected, 
isolated and characterized. Any desired one or also several dif- 
ferent partial sequences may be used, in particular a part of the 
conserved motif already described above. 

In doing so, it is particularly advantageous if one of the 
above-indicated DNA molecules is covalently associated with a de- 
tectable labelling substance. As the labelling substance, any 
common marker can be used, such as, e.g., fluorescent, lumines- 
cent, radioactive markers, non-iso topic markers, such as bio tin, 
etc. In this manner, reagents are provided which are suitable for 
the detection, selection and quantitation of corresponding DNA 
molecules in solid tissue samples (e.g. from plants) or also in 
liquid samples, by means of hybridizing methods. 

A further aspect of the invention relates to a biologically 
functional vector which comprises one of the above-indicated DNA 
molecules or parts thereof of differing lengths with at least 20 
base pairs. For transfection into host cells, an independent 
vector capable of amplification is necessary, wherein, depending 
on the host cell, transfection mechanism, task and size of the 
DNA molecule, a suitable vector can be used. Since a large number 
of different vectors is known, an enumeration thereof would go 
beyond the limits of the present application and therefore is 
done without here, particularly since the vectors are very well 
known to the skilled artisan (as regards the vectors as well as 
all the techniques and terms used in this specification which are 
known to the skilled artisan, cf. also Sambrook Maniatis). Ide- 
ally, the vector has a small molecule mass and should comprise 
selectable genes so as to lead to an easily recognizable pheno- 
type in a cell so thus enable an easy selection of vector-con- 
taining and vector-free host cells. To obtain a high yield of DNA 
and corresponding gene products, the vector should comprise a 
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strong promoter, as well as an enhancer, gene amplification sig- 
nals and regulator sequences. For an autonomous replication of 
the vector, furthermore, a replication origin is important. Poly- 
adenylation sites are responsible for correct processing of the 
mRNA and splice signals for the RNA transcripts. If phages, vi- 
ruses or virus particles are used as the vectors, packaging sig- 
nals will control the packaging of the vector DNA. For instance, 
for transcription in plants, Ti plasmids are suitable, and for 
transcription in insect cells, baculoviruses , and in insects, re- 
spectively, transposons, such as the P element. 

If the above-described inventive vector is inserted into a 
plant or into a plant cell, a post-transcriptional suppression of 
the gene expression of the endogenous ocl,3-fucosyl transferase 
gene is attained by transcription of a transgene homologous 
thereto or of parts thereof, in sense orientation. For this sense 
technique, furthermore, reference is made to the publications by 
Baucombe 1996, Plant. Mol. Biol., 9:373-382, and Brigneti et al . , 
1998, EMBO J. 17:6739-6746. This strategy of "gene silencing" is 
an effective way of suppressing the expression of the al,3-fuco- 
syl transferase gene, cf. also Waterhouse et al . , 1998, Proc . 
Natl. Acad. Sci . USA, 95:13959-13964. 

Furthermore, the invention relates to a biologically func- 
tional vector comprising a DNA molecule according to one of the 
above-described embodiments, or parts thereof of differing 
lengths in reverse orientation to the promoter. If this vector is 
transfected in a host cell, an " antisense mRNA " will be read 
which is complementary to the mRNA of the GlcNAc-al , 3-f ucosyl 
transferase and complexes the latter. This bond will either hin- 
der correct processing, transportation, stability or, by prevent- 
ing ribosome annealing, it will hinder translation and thus the 
normal gene expression of the GlcNAc-al , 3-f ucosyl transferase. 

Although the entire sequence of the DNA molecule could be 
inserted into the vector, partial sequences thereof because of 
their smaller size may be advantageous for certain purposes. With 
the antisense aspect, e.g., it is important that the DNA molecule 
is large enough to form a sufficiently large antisense mRNA which 
will bind to the transferase mRNA. A suitable antisense RNA mole- 
cule comprises, e.g., from 50 to 200 nucleotides since many of 
the known, naturally occurring antisense RNA molecules comprise 
approximately 100 nucleotides. 




• 
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For a particularly effective inhibition of the expression of 
an active al,3-fucosyl transferase, a combination of the sense 
technique and the antisense technique is suitable (Waterhouse et 
al., 1998, Proc. Natl. Acad. Sci . , USA, 95:13959-13964). 

Advantageously, rapidly hybridizing RNA molecules are used. 
The efficiency of antisense RNA molecules which have a size of 
more than 50 nucleotides will depend on the annealing kinetics in 
vitro. Thus, e.g., rapidly annealing antisense RNA molecules ex- 
hibit a greater inhibition of protein expression than slowly hy- 
bridizing RNA molecules (Wagner et al . , 1994, Annu. Rev. 
Microbiol., 48:713-742; Rittner et al . , 1993, Nucl. Acids Res., 
21:1381-1387). Such rapidly hybridizing antisense RNA molecules 
particularly comprise a large number of external bases (free ends 
and connecting sequences) , a large number of structural subdo- 
mains (components) as well as a low degree of loops (Patzel et 
al . 1998; Nature Biotechnology, 16; 64-68) . The hypothetical sec- 
ondary structures of the antisense RNA molecule may, e.g., be de- 
termined by aid of a computer program, according to which a 
suitable antisense RNA DNA sequence is chosen. 

Different sequence regions of the DNA molecule may be in- 
serted into the vector. One possibility consists, e.g., in in- 
serting into the vector only that part which is responsible for 
ribosome annealing. Blocking in this region of the mRNA will suf- 
fice to stop the entire translation. A particularly high effi- 
ciency of the antisense molecules also results for the 5'- and 
3 ' -nontranslated regions of the gene. 

Preferably, the DNA molecule according to the invention in- 
cludes a sequence which comprises a deletion, insertion and/or 
substitution mutation. The number of mutant nucleotides is vari- 
able and varies from a single one to several deleted, inserted or 
substituted nucleotides. It is also possible that the reading 
frame is shifted by the mutation. In such a "knock-out gene" it 
is merely important that the expression of a GlcNAc-al , 3-f ucosyl 
transferase is disturbed, and the formation of an active, func- 
tional enzyme is prevented. In doing so, the site of the mutation 
is variable, as long as expression of an enzymatically active 
protein is prevented. Preferably, the mutation in the catalytic 
region of the enzyme which is located in the C-terminal region. 
The method of inserting mutations in DNA sequences are well known 
to the skilled artisan, and therefore the various possibilities 
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of mutageneses need not be discussed here in detail. Coincidental 
mutageneses as well as, in particular, directed mutageneses, e.g. 
the site-directed mutagenesis, oligonucleotide-controlled muta- 
genesis or mutageneses by aid of restriction enzymes may be em- 
ployed in this instance. 

The invention further provides a DNA molecule which codes 
for a ribozyme which comprises two sequence portions of at least 
10 to 15 base pairs each, which are complementary to sequence 
portions of an inventive DNA molecule as described above so that 
the ribozyme complexes and cleaves the mRNA which is transcribed 
from a natural GlcNAc-ocl , 3-f ucosyl transferase DNA molecule. The 
ribozyme will recognized the mRNA of the GlcNAc-al , 3 -f ucosyl 
transferase by complementary base pairing with the mRNA. Subse- 
quently, the ribozyme will cleave and destroy the RNA in a se- 
quence-specific manner, before the enzyme is translated. After 
dissociation from the cleaved substrate, the ribozyme will re- 
peatedly hybridize with RNA molecules and act as specific endonu- 
clease. In general, ribozymes may specifically be produced for 
inactivation of a certain mRNA, even if not the entire DNA se- 
quence which codes for the protein is known. Ribozymes are par- 
ticularly efficient if the ribosomes move slowly along the mRNA. 
In that case it is easier for the ribozyme to find a ribosome- 
free site on the mRNA. For this reason, slow ribosome mutants are 
also suitable as a system for ribozymes (J. Burke, 1997, Nature 
Biotechnology; 15, 414-415). This DNA molecule is particularly 
advantageous for the downregulation and inhibition, respectively, 
of the expression of plant GlcNAc-al , 3-f ucosyl transferases. 

One possible way is also to use a varied form of a ribozmye, 
i.e. a minizyme. Minizymes are efficient particularly for cleav- 
ing larger mRNA molecules. A minizyme is a hammer head ribozyme 
which has a short oligonucleotide linker instead of the 
stem/loop II. Dimer-minizymes are particularly efficient 
(Kuwabara et al . , 1998, Nature Biotechnology, 16; 961-965). 
Consequently, the invention also relates to a biologically func- 
tional vector which comprises one of the two last-mentioned DNA 
molecules (mutation or ribozyme-DNA molecule) . What has been said 
above regarding vectors also applies in this instance. Such a 
vector can be, for example, inserted into a microorganism and can 
be used for the production of high concentrations of the above 
described DNA molecules . Furthermore such a vector is particu- 
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larly good for the insertion of a specific DNA molecule into a 
plant or an insect organism in order to downregulate or com- 
pletely inhibit the GlcNAc-al , 3-f ucosyl transferase production in 
this organism . 

According to the invention, there is provided a method of 
preparing a cDNA comprising the DNA molecule of the invention, 
wherein RNA is isolted from an insect or plant cell, in particu- 
lar from hypokotyl cells, by means of which a reverse transcrip- 
tion is carried out after having admixed a reverse transcriptase 
and primers. The individual steps of this method are carried out 
according to protocols known per se. For the reverse transcrip- 
tion, on the one hand, it is possible to produce the cDNA of the 
entire mRNA with the help of oligo(dT) primers, and only then to 
carry out a PCR by means of selected primers so as to prepare 
DNA molecules comprising the GlcNAc-al , 3-f ucosyl transferase 
gene. On the other hand, the selected primers may directly be 
used for the reverse transcription so as to obtain short, spe- 
cific cDNA. The suitable primers may be prepared e.g. syntheti- 
cally according to the pattern of cDNA sequences of the 
transferase. With the help of this method big quantities of the 
inventive cDNA molecules can be produced quickly in a simple way 
and with few mistakes. 

The invention furthermore relates to a method of cloning a 
GlcNAc-al , 3-f ucosyl transferase, characterized in that the DNA 
molecule of the invention is cloned into a vector which subse- 
quently is transfected into a host cell or host, respectively, 
wherein, by selection and amplification of transfected host 
cells, cell lines are obtained which express the active GlcNac- 
al, 3-f ucosyl transferase. The DNA molecule is inserted into the 
vector by aid of restriction endonucleases , e.g.. For the vector, 
there applies what has already been said above. What is important 
in this method is that an efficient host-vector system is chosen. 
To obtain an active enzyme, eukaryotic host cells are particu- 
larly suitable. One possible way is to transfect the vector in 
insect cells. In doing so, in particular an insect virus would 
have to be used as vector, such as, e.g., baculovirus. 

Of course, human or other vertebrate cells can also be 
transfected, in which case the latter would express an enzyme 
foreign to them. 

Preferably, a method of preparing recombinant host cells, in 
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particular plant or insect cells, or plants or insects, respec- 
tively, with a suppressed or completely stopped GlcNac-al , 3-f uco- 
syl transferase production is provided, which is characterized in 
that at least one of the vectors according to the invention, i.e. 
that one comprising the inventive DNA molecule, the mutant DNA 
molecule or the DNA molecule coding for ribozymes or the one com- 
prising the DNA molecule in inverse orientation to the promoter, 
is inserted into the host cell or plant or into the insect. What 
has been said above for the transfection also is applicable in 
this case. _ . _ . _ 

As the host cells, plant cells may, e.g., be used, wherein, 
e.g., the Ti plasmid with the agrobacterium system is eligible. 
With the agrobacterium system it is possible to transfect a plant 
directly: agrobacteria cause root stem galls inplants . If agro- 
bacteria infect an injured plant, the bacteria themselves do not 
get into the plant, but they insert the recombinant DNA portion, 
the so-called T-DNA, from the annular, extra chromosomal, tumour- 
inducing Ti-plasmid into the plant cells. The T-DNA, and thus 
also the DNA molecule inserted therein, are installed in the 
chromosomal DNA of the cell in a stable manner so that the genes 
of the T-DNA will be expressed in the plant. 

There exist numerous known, efficient transfection mecha- 
nisms for different host systems. Some examples are electropora- 
tion, the calcium phosphate method, microinjection, liposome 
method. 

Subsequently, the transfected cells are selected, e.g. on 
the basis of antibiotic resistences for which the vector com- 
prises genes, or other marker genes. Then the transfected cell 
lines are amplified, either in small amounts, e.g. in Petri 
dishes, or in large amounts, e.g. in f ermentors . Furthermore, 
plants have a particular characteristic, i.e. they are capable to 
re-develop from one (transfected) cell or from a protoplast, re- 
spectively, to a complete plant which can be grown. 

Depending on the vector used, processes will occur in the 
host so that the enzyme expression will be suppressed or com- 
pletely blocked: 

If the vector comprising the DNA molecule with the deletion, 
insertion or substitution mutation is transfected, a homologous 
recombination will occur: the mutant DNA molecule will recognize 
the identical sequence in the genome of the host cell despite its 
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mutation and will be inserted exactly on that place so that a 
"knock-out gene" is formed. In this manner, a mutation is intro- 
duced into the gene for the GlcNAc-al , 3-f ucosyl transferase which 
is capable of inhibiting the faultless expression of the GlcNAc- 
al, 3-fucosyl transferase. As has been explained above, with this 
technique it is important that the mutation suffices to block the 
expression of the active protein. After selection and amplifica- 
tion, the gene may be sequenced as an additional check so as to 
determine the success of the homologous recombination or the de- 
- gree of_ mutation, respectively. 

If the vector comprising the DNA molecule coding for a ribo- 
zyme is transfected, the active ribozyme will be expressed in the 
host cell. The ribozyme complexes the complementary mRNA se- 
quence of the GlcNAc-al , 3-fucosyl transferase at least at a cer- 
tain site, cleaves this site, and in this manner it can inhibit 
the translation of the enzyme. In this host cell as well as in 
cell lines, or optionally, plant, respectively, derived there- 
from, GlcNAc-al , 3-f ucosyl transferase will not be expressed. 
In case the vector comprises the inventive DNA molecule in sense 
or inverse direction to the promoter, a sense or antisense-mRNA 
will be expressed in the transfected cell (or plant, respec- 
tively) . The antisense mRNA is complementary at least to a part 
of the mRNA sequence of the GlcNAc-al , 3-f ucosyl transferase and 
may likewise inhibit translation of the enzyme. As an example of 
a method of suppressing the expression of a gene by antisense 
technique, reference is made to the publication by Smith et al . , 
1990, Mol . Gen. Genet. 224:477-481, wherein in this publication 
the expression of a gene involved in the maturing process of to- 
matoes is inhibited. 

In all the systems, expression of the GlcNAc-al , 3-f ucosyl 
transferase is at least suppressed, preferably even completely 
blocked. The degree of the disturbance of the gene expression 
will depend on the degree of complexing, homologous recombina- 
tion, on possible subsequent coincidental mutations and on other 
processes in the region of the genome. The transfected cells are 
checked for GlcNac-otl , 3 -f ucosyl transferase activity and se- 
lected . 

Moreover, it is possible to still further increase the 
above-described suppression of the expression of the al, 3-f ucosyl 
transferase by introducing into the host a vector comprising a 
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gene coding for a mammalian protein, e.g. pi , 4-galactosyl trans- 
ferase, in addition to the insertion of an above-described vec- 
tor. Fucosylation may be reduced by the action of other mammalian 
enzymes, the combination of the inhibition of the expression of 
an active al,3-fucosyl transferase by means of the inventive vec- 
tor and by means of a mammalian enzyme vector being particularly 
efficient . 

Any type of plant may be used for transf ection, e.g. mung 
bean, tobacco plant, tomato and/or potato plant. 

Another advantageous method of producing recombinant host cells, 
in particular plant or insect cells, or plants or insects, re- 
spectively, consists in that the DNA molecule comprising the mu- 
tation is inserted into the genome of the host cell, or plant or 
insect, respectively, in the place of the non-mutant homologous 
sequence (Schaefer et al . , 1997, Plant J.; 11 ( 6 ): 1195-1206 ) . This 
method thus does not function with a vector, but with a pure DNA 
molecule. The DNA molecule is inserted into the host e.g. by gene 
bombardment, microinjection or electroporation, to mention just 
three examples. As has already been explained, the DNA molecule 
binds to the homologous sequence in the genome of the host so 
that a homologous recombination and thus reception of the dele- 
tion, insertion or substitution mutation, respectively, will re- 
sult in the genome : Expression of the GlcNAc-al , 3-f ucosyl 
transferase can be suppressed or completely blocked, respec- 
tively . 

A further aspect of the invention relates to plants or plant 
cells, respectively, as well as insect or insect cells, respec- 
tively, their GlcNAc-al , 3-f ucosyl transferase activity being less 
than 50%, in particular less than 20%, particularly preferred 0%, 
of the GlcNAc-al , 3-f ucosyl transferase activity occurring in 
natural plants or plant cells, respectively, and insects or in- 
sect cells, respectively. The advantage of these plants or plant 
cells, respectively, is that the glycoproteins produced by them 
do not comprise any or hardly comprise any al,3-bound fucose. If 
products of these plants or insects , respectively , are taken up 
by human or vertebrate bodies, there will be no immune reaction 
to the al, 3-fucose epitope. 

Preferably, recombinant plants or plant cells, respectively, 
are provided which have been prepared by one of the methods de- 
scribed above, their GlcNAc-al , 3-f ucosyl transferase production 
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being suppresed or completely blocked, respectively. 

The invention also relates to recombinant insects or insect 
cells, respectively, which have been prepared by one of the meth- 
ods described above and whose GlcNAc-al , 3-f ucosyl transferase 
production is suppressed or completely blocked, respectively. 
Also in this instance, no glycoproteins having al, 3 -bound fucose 
residues are produced so that likewise no immune reaction to the 
al, 3-fucose epitope will occur. 

The invention also relates to a PNA molecule comprising a 
base sequence complementary to the sequence of the DNA mole- 
cule according to the invention as well as partial sequences 
thereof. PNA (peptide nucleic acid) is a DNA-like sequence, the 
nucleobases being bound to a pseudo-peptide backbone. PNA gener- 
ally hybridizes with complementary DNA- , RNA- or PNA-oligomers by 
Watson-Crick base pairing and helix formation. The peptide back- 
bone ensures a greater resistance to enzymatic degradation. The 
PNA molecule thus is an improved antisense agent. Neither nucle- 
ases nor proteases are capable of attacking a PNA molecule. The 
stability of the PNA molecule, if bound to a complementary se- 
quence, comprises a sufficient steric blocking of DNA and RNA po- 
lymerases, reverse transcriptase, telomerase and ribosomes . 
If the PNA molecule comprises the above-mentioned sequence, it 
will bind to the DNA or to a site of the DNA, respectively, which 
codes for GlcNAc-cxl , 3-f ucosyl transferase and in this way is ca- 
pable of inhibiting transcription of this enzyme. As it is nei- 
ther transcribed nor translated, the PNA molecule will be 
prepared synthetically, e.g. by aid of the the t-Boc technique. 
Advantageously, a PNA molecule is provided which comprises a base 
sequence which corresponds to the sequence of the inventive DNA 
molecule as well as partial sequences thereof. This PNA molecule 
will complex the mRNA or a site of the mRNA of GlcNAc-al , 2-f uco- 
syl transferase so that the translation of the enzyme will be in- 
hibited. Similar arguments as set forth for the antisense RNA 
apply in this case. Thus, e.g., a particularly efficient complex- 
ing region is the translation start region or also the 5 ' -non- 
translated regions of mRNA. 

A further aspect of the present invention relates to a 
method of preparing plants or insects, or cells, respectively, in 
particular plant or insect cells which comprise a blocked expres- 
sion of the GlcNAc-al , 3-f ucosyl transferase on transcription or 
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translation level, respectively, which is characterized in that 
inventive PNA molecules are inserted in the cells. To insert the 
PNA molecule or the PNA molecules, respectively, in the cell, 
again conventional methods, such as, e.g., electroporation or mi- 
croinjection, are used. Particularly efficient is insertion if 
the PNA oligomers are bound to cell penetration peptides, e.g. 
transportan or pAntp (Pooga et al . , 1998, Nature Biotechnology, 
16; 857-861) . 

The invention provides a method of preparing recombinant 
-glycoproteins, which _ is characterized in that the inventive, re- 
combinant plants or plant cells, respectively, as well as recom- 
binant insects or insect cells, respectively, whose GlcNAc-al , 3- 
fucosyl transferase production is suppressed or completely 
blocked, respectively, or plants or insects, or cells, respec- 
tively, in which the PNA molecules have been inserted according 
to the method of the invention, are transfected with the gene 
that expresses the glycoprotein so that the recombinant glycopro- 
teins are expressed. In doing so, as has already been described 
above, vectors comprising genes for the desired proteins are 
transfected into the host or host cells, respectively, as has 
also already been described above. The transfected plant or in- 
sect cells will express the desired proteins, and they have no or 
hardly any al, 3 -bound fucose. Thus, they do not trigger the im- 
mune reactions already mentioned above in the human or vertebrate 
body. Any proteins may be produced in these systems. 
Advantageously, a method of preparing recombinant human glycopro- 
teins is provided which is characterized in that the recombinant 
plants or plant cells, respectively, as well as recombinant in- 
sects or insect cells, respectively, whose GlcNAc-al , 3-f ucosyl 
transferase production is suppressed or completely blocked, or 
plants or insects, or cells, respectively, in which PNA molecules 
have been inserted according to the method of the invention, are 
transfected with the gene that expresses the glycoprotein so that 
the recombinant glycoproteins are expressed. By this method it 
becomes possible to produce human proteins in plants (plant 
cells) which, if taken up by the human body, do not trigger any 
immune reaction directed against ocl,3-bound fucose residues. 
There, it is possible to utilize plant types for producing the 
recombinant glycoproteins which serve as food stuffs, e.g. ba- 
nana, potato and/or tomato. The tissues of this plant comprise 
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the recombinant glycoprotein so that, e.g. by extraction of the 
recombinant glycoprotein from the tissue and subsequent admini- 
stration, or directly by eating the plant tissue, respectively, 
the recombinant glycoprotein is taken up in the human body. 
Preferably, a method of preparing recombinant human glycopro- 
teins for medical use is provided, wherein the inventive, recom- 
binant plants or plant cells, respectively, as well as 
recombinant insects or insect cells, respectively, whose GlcNAc- 
al,3-fucosyl transferase production is suppressed or completely 
blocked,, respectively, or plants or insects, or cells, respec- 
tively, into which the PNA molecules have been inserted according 
to the method of the invention, are transfected with the gene 
that expresses the glycoprotein so that the recombinant glycopro- 
teins are expressed. In doing so, any protein can be used which 
is of medical interest. 

Moreover, the present invention relates to recombinant gly- 
coproteins according to a method described above, wherein they 
have been prepared in plant or insect systems and wherein their 
peptide sequence comprises less than 50%, in particular less than 
20%, particularly preferred 0%, of the al,3-bound fucose residues 
occurring in proteins expressed in non-fucosyl transf erase-re- 
duced plant or insect systems. Naturally, glycoproteins which do 
not comprise al,3-bound fucose residues are to be preferred. The 
amount of al,3-bound fucose will depend on the degree of the 
above-described suppression of the GlcNAc-al , 3-f ucosyl transfe- 
rase. 

Preferably, the invention relates to recombinant human gly- 
coproteins which have been produced in plant or insect systems 
according to a method described above and whose peptide sequence 
comprises less than 50%, in particular less than 20%, particu- 
larly preferred 0%, of the otl,3-bound fucose residues occurring 
in the proteins expressed in non-fucosyl transf erase-reduced 
plant or insect systems. 

A particularly preferred embodiment relates to recombinant 
human glycoproteins for medical use which have been prepared in 
plant or insect systems according to a method described above and 
whose peptide sequence comprises less than 50%, in particular 
less than 2 0%, particularly preferred 0%, of the al,3-bound fu- 
cose residues occurring in the proteins expressed in non-fucosyl 
transf erase-reduced plant or insect systems. 
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The glycoproteins according to the invention may include 
other bound oligosaccharide units specific for plants or insects, 
respectively, whereby - in the case of human glycoproteins - they 
differ from these natural glycoproteins. Nevertheless, by the 
glycoproteins according to the invention, a slighter immune reac- 
tion or no immune reaction at all, respectively, is triggered in 
the human body, since, as has already been explained in the in- 
troductory portion of the specification, the al,3-bound fucose 
residues are the main cause for the immune reactions or cross im- 
mune reaction, respectively , to plant, and _insect glycoproteins. 

A further aspect comprises a pharmaceutical composition com- 
prising the glycoproteins according to the invention. In addition 
to the glycoproteins of the invention, the pharmaceutical compo- 
sition comprises further additions common for such compositions. 
These are, e.g., suitable diluting agents of various buffer con- 
tents (e.g. Tris-HCl, acetate, phosphate, pH and ionic strength, 
additives, such as tensides and solubilizers (e.g. Tween 80, 
Polysorbate 80), preservatives (e.g. Thimerosal, benzyl alcohol), 
adjuvants, antioxidants (e.g. ascorbic acid, sodium metabisul- 
f ite) , emulsifiers, fillers (e.g. lactose, mannitol), covalent 
bonds of polymers, such as polyethylene glycol, to the protein, 
incorporation of the material in particulate compositions of 
polymeric compounds, such as polylactic acid, polyglycolic acid, 
etc. or in liposomes, auxiliary agents and/or carrier substances 
which are suitable in the respective treatment. Such compositions 
will influence the physical condition, stability, rate of in vivo 
liberation and rate of in vivo excretion of the glycoproteins of 
the invention . 

The invention also provides a method of selecting DNA mole- 
cules which code for a GlcNAc-al , 3-f ucosyl transferase, in a sam- 
ple, wherein the labelled DNA molecules of the invention are 
admixed to the sample, which bind to the DNA molecules that code 
for a GlcNAc-al, 3-f ucosyl transferase. The hybridized DNA mole- 
cules can be detected, quantitated and selected. For the sample 
to contain single strand DNA with which the labelled DNA mole- 
cules can hybridize, the sample is denatured, e.g. by heating. 
One possible way is to separate the DNA to be assayed, possibly 
after the addition of endonucleases , by gele electrophoresis on 
an agarose gel. After having been transferred to a membrane of 
nitrocellulose, the labelled DNA molecules according to the in- 
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vention are admixed which hybridize to the corresponding homolo- 
gous DNA molecule ("Southern blotting"). 

Another possible way consists in finding homologous genes 
from other species by PCR-dependent methods using specific and/or 
degenerated primers, derived from the sequence of the DNA mole- 
cule according to the invention . 

Preferably, the sample for the above-identified inventive 
method comprises genomic DNA of a plant or insect organism. By 
this method, a large number of plants and insects is assayed in a 
very rapid and ef fici en t manner for the presence of the GlcNAc- 
al,3-fucosyl transferase gene. In this manner, it is respectively 
possible to select plants and insects which do not comprise this 
gene, or to suppress or completely block, respectively, the ex- 
pression of the GlcNAc-al , 3-f ucosyl transferase in such plants 
and insects which comprise this gene, by an above-described 
method of the invention, so that subsequently they may be used 
for the transfection and production of (human) glycoproteins. 
The invention also relates to DNA molecules which code for a 
GlcNAc-al , 3-fucosyl transferase which have been selected accord- 
ing to the two last-mentioned methods and subsequently have been 
isolated from the sample. These molecules can be used for further 
assays. They can be sequenced and in turn can be used as DNA 
probes for finding GlcNAc-al , 3-f ucosyl transferases. These - la- 
belled - DNA molecules will function for organisms, which are re- 
lated to the organisms from which they have been isolated, more 
efficiently as probes than the DNA molecules of the invention. 
A further aspect of the invention relates to a preparation of 
GlcNAc-al , 3-f ucosyl transferase cloned according to the invention 
which comprises isoforms having pi values of between 6.0 and 9.0, 
in particular between 6.8 and 8.2. The pi values of a protein is 
that pH value at which its net charge is zero and is dependent on 
the amino acid sequence, the glycosylation pattern as well as on 
the spatial structure of the protein. The GlcNAc-al , 3-f ucosyl 
transferase comprises at least 7 isoforms which have a pi value 
in this range. The reason for the various isoforms of the trans- 
ferase are, e.g., different glycosylations as well as limited 
proteolysis. Tests have shown that mung bean seedlings of vari- 
ous plants have different relationships of the isozymes. The pi 
value of a protein can be determined by isoelectric focussing, 
which is known to the skilled artisan. 
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The main isoform of the enzyme has an apparent molecular weight 
of 54 kDa. 

In particular, the preparation of the invention comprises 
isoforms having pi values of 6.8, 7.1 and 7.6. 

The invention also relates to a method of preparing "planti- 
fied M carbohydrate units of human and other vertebrate glycopro- 
teins, wherein fucose units as well as GlcNAc-al , 3-f ucosyl 
transferase encoded by an above-described DNA molecule are ad- 
mixed to a sample that comprises a carbohydrate unit or a glyco- 
protein, respectively, so that fucose in al , 3-position will be 
bound by the GlcNAc-al , 3-f ucosyl transferase to the carbohydrate 
unit or to the glycoprotein, respectively. By the method accord- 
ing to the invention for cloning GlcNAc-al , 3-f ucosyl transferase 
it is possible to produce large amounts of purified enzyme. To 
obtain a fully active transferase, suitable reaction conditions 
are provided. It has been shown that the transferase has a par- 
ticularly high activity at a pH of approximately 7, if 2- (N-mor- 
pholino) -ethane sulfonic acid-HCl is used as the buffer. In the 
presence of bivalent cations, in particular Mn 2+ , the activity of 
the recombinant transferase is enhanced. The carbohydrate unit is 
admixed to the sample either in unbound form or bound to a pro- 
tein. The recombinant transferase is active for both forms. 
The invention will be explained in more detail by way of the fol- 
lowing examples and drawing figures to which, of course, it shall 
not be restricted. 

In detail, in the drawings, 
Figs, la and lb show, as curves, the measured amounts of protein 
and the measured enzyme activity in the individual fractions of 
the eluate; 

Fig. 2 shows an electrophoresis gel analysis of GlcNAc-al , 3-f uco- 
syl transferase; 

Fig. 3 shows the result of the isoelectric focussing and the 
measured transferase activity of the individual isoforms; 
Fig. 4 shows the N-terminal sequences of 4 tryptic peptides 1-4 
as well as the DNA sequence of three primers, SI, A2 and A3; 
Figs. 5a and 5b show the cDNA sequence of al,3-fucosyl transfe- 
rase; 

Figs. 6a and 6b show the amino acid sequence of al,3-fucosyl 
transferase derived therefrom; 

Fig. 7. is a schematic representation of the al, 3-f ucosyl transfe- 
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rase as well as the hydrophobic city of the amino acid residues; 
Fig. 8 shows a comparison of the conserved motifs of various fu- 
cosyl transferases; 

Fig. 9 shows a comparison of the fucosyl transferase activity of 
insect cells transfected with the ccl, 3-fucosyl transferase gene 
with that of a negative control; 

Figs. 10a and 10b show structures of different acceptors of the 

al , 3 -fucosyl transferase; 

Figs. 11 and 12 show mass spectra; and 

Fig. _13 shows the result of a HPLC. 

Example 1: 

Isolation of the core-gel , 3-fucosvl transferase 
All the steps were, carried out at 4°C. Mung bean seedlings 
were homogenized in a mixer, 0.75 volumes of extraction buffer 
being used per kg of beans. Subsequently, the homogenate was fil- 
tered through two layers of cotton fabric, and the filtrate was 
centrifuged for 40 min at 3 0000xg. The supernatant was discarded, 
and the pellet was extracted with solution buffer over night with 
continuous stirring. Subsequent centrif ugation at 3 0000xg for 40 
min yielded the triton. extract. 

The triton extract was purified as follows: 
Step 1: The triton extract was applied to a microgranular 
diethyl amino ethyl cellulose anion exchanger DE52 cellulose col- 
umn (5x2 8 cm) from Whatman, which previously had been calibrated 
with buffer A. The non-bound fraction was further treated in step 
2. 

Step 2: The sample was applied to an Affi-Gel Blue column 
(2,5x32) column calibrated with buffer A. After washing of the 
column whith this buffer, adsorbed protein was eluted with buffer 
A comprising 0.5 M NaCl . 

Step 3: After dialysis of the eluate from step 2 against 
buffer B, it was applied to an S-Sepharose column calibrated with 
the same buffer. Bound protein was eluted with a linear gradient 
of from 0 to 0 . 5 M NaCl in buffer B. Fractions with GlcNAc-al,3- 
fucosyl transferase were pooled and dialyzed against buffer C. 

Step 4: The dialyzed sample was applied to a GnGn-Sepharose 
column calibrated with buffer C. The bound protein was eluted 
with buffer C comprising 1 M NaCl instead of MnCl 2 . 

Step 5: Subsequently, the enzyme was dialyzed against buffer 
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D and applied to a GDP-Hexanolamine-Sepharose column. After hav- 
ing washed the column with buffer D, the transferase was eluted 
by substituting MgCl 2 and NaCl with 0.5 mM GDP. Active fractions 
were pooled, dialyzed against 20 mM Tris-HCl buffer, pH 7.3, and 
lyophilized. 

The enzymatic activity of the GlcNAc-al , 3-f ucosyl transfe- 
rase was determined by using GnGn peptide and GDP-L- [U- 14 C] -fucose 
at substrate concentrations of 0.5 and 0.25 each, in the presence 
of 2- (N-morpholino) ethanesulf onic acid-HCl buffer, Triton X-100, 
MnCl 2 , GlcNAc and AMP (according to Staudacher et al . , 1998, Gly- 
coconjugate J. 15, 355-360; Staudacher et al . , 1991, Eur. J. Bio- 
chem. 199, 745-751) . 

Protein concentrations were determined by aid of the bicin- 
choninic acid method (Pierce) or, in the final steps of enzyme 
purification, by means of amino acid analysis (Altmann 1992, 
Anal. Biochem. 204, 215-219). 

In Figs, la and lb, the measured amounts of protein and the 
measured enzyme activity in the individual fractions of the elu- 
ate are illustrated as curves. Fig. la shows the above-described 
separation on the S-Sepharose column, Fig. lb shows the separa- 
tion on the GnGn-Sepharose column, the circle representing pro- 
tein, the black, full circle representing GlcNAc-al , 3-f ucosyl 
transferase, and the square illustrating N-acetyl-p-glucosamini- 
dase. One U is defined as that amount of enzyme which transfers 1 
mmol of fucose onto an acceptor per minute. 

Table 1 shows the individual steps of transferase purifica- 
tion . 
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Table 1 

Purification Total Total Specific Purification Yield 

step protein activity activity factor 





mg 




mU 


mU/mg 


-fold 


% 




Triton X-100 
















extract 


91500 




4846 


0. 05 


1 


100 




DE52 


43700 




4750 


0. 10 


2 


98 


. 0 


Affigel Blue 


180 


. 5 


4134 


23 


460 


85 


.3 


.S-Sepharose 


8 


.4 


3251 


390 


7800 


67 


. 1 


GnGn-Sepharose 


0 


.13 1 


1044 


8030 


160000 


21 


. 5 


GDP-Hexanolamine 
















Sepharose 


0 


. 02 1 


867 


43350 


867000 


17 . 


9 



determined by amino acid analysis 



Extraction buffer: 

0.5 mM Dithiothreitol 
1 mM EDTA 

0.5% Polyvinyl polypyrrolidone 
0.25 M Sucrose 

50 mM Tris-HCl buffer, pH 7 . 3 
Solution buffer: 

0.5 mM Dithiothreitol 

1 mM EDTA 

1 . 5% Triton X-100 
50 mM Tris-HCl, pH 7 . 3 
Buffer A: 

2 5 mM Tris-HCl buffer, pH 7.3, comprising: 
0.1% Triton X-100 and 

0.02% NaN 3 
Buffer B: 

25 mM Na citrate buffer, pH 5.3, comprising: 
0.1% Triton X-100 and 
0 . 02% NaN 3 
Buffer C: 

25 mM Tris-HCl buffer, pH 7.3, comprising: 
5 mM MnCl 2 nd 
0.02% NaN 3 
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Buffer D: 

25 mM Tris-HCl, pH 7.3, comprising: 

10 mM MgCl 2 

0 . 1 M NaCl, and 

0.02% NaN 3 

Example 2: 

SDS-PAGE and isoelectric focussing 
An SDS-PAGE was carried out in a Biorad Mini-protean cell on gels 
with. 12.5% acrylamide and 1% bisacrylamide . The gels were stained 
either with Coomassie Brilliant Blue R-250 or Silver. Isoelectric 
focussing of the fucosyl transferase was carried out on prefabri- 
cated gels having a pi range of between 6-9 (Servalyt precotes 6- 
9, Serva) . The gels were stained with silver according to the 
producer's protocol. For the two-dimensional electrophoresis, 
lanes were cut out of the focussing gel, treated with S-alky- 
lating reagents and SDS and subjected to an SDS-PAGE, as de- 
scribed above. 

Fig. 2 shows the illustration of an electrophoresis gel of 
GlcNAc-al , 3 -fucosyl transferase, the two-dimensional electropho- 
resis being indicated on the left-hand side, and the one-dimen- 
sional SDS-PAGE being illustrated on the right-hand side. The 
lane denoted by A is a standard, the lane denoted by B is the 
GlcNAc-al, 3-fucosyl transferase from the GnGn-Sepharose column, 
and the lane denoted by C is the "purified" GlcNAc-al , 3-fucosyl 
transferase, i.e. the fraction of the GDP Hexanolamine Sepharose 
column. The two bands at 54 and 5 6 kDa represent isoforms of the 
transferase . 

Fig. 3 shows the result of the isoelectric focussing. Lane A was 
stained with silver, on lane B, the activity of the transferase 
isoforms was tested. The activity is indicated as % fucose which 
had been transferred from GDP-fucose onto the substrate. 
Example 3 : 

Peptide sequencing 
For sequencing of the protein, bands were cut out of the Coomas- 
sie-stained SDS-Polyacrylamide gel, carboxyamido-methylated and 
cleaved with trypsin according to Gorg et al . 1988, Electrophore- 
sis, 9, 681-692. The tryptic peptides were separated with the 
reverse phase HPLC on a 1.0x250 mm Vydac C18 at 40°C at a flow 
rate of 0.05 ml/min, wherein a HP 1100 apparatus (Hewlett- 
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Packard) was used. The isolated peptides were separated with a 
Hewlett-Packard G1005 A Protein Sequencing System according to 
the producer's protocol. Furthermore, the peptide mixture was 
analyzed by Ingel digestion with MALDI-TOF MS (see below) . 
Fig. 4 shows the N- terminal sequences of 4 tryptic peptides 1-4 
(SEQ ID NO: 5-8) . Departing from the first three peptides, prim- 
ers SI, A2 and A3 were prepared (SEQ ID NO: 9-11) . 
Example 4 : 

RT-PCR and cDNA cloning 
The entire^ RNA was isolated from a 3-day-old mung bean hypocotyl, 
wherein the SV Total RNA Isolating System of Promega was used. To 
prepare the first strand cDNA, the entire RNA was incubated for 1 
h at 48°C with AMV reverse transcriptase and oligo(dT) primers, 
wherein the Reverse Transcription System of Promega was used. 
The first strand cDNA was subjected to a PCR, wherein, a combina- 
tion of sense and antisense primers was used: 

To 10 \il of the reverse transcription reaction mixture, the fol- 
lowing was added: 

50 ul with 0.1 mmol of each primer, 0.1 mM dNTPs , 2 mM MgCl 2 , 10 
mM Tris-HCl buffer, pH 9,0, 50 mM KC1 and 0.1% Triton X-100. 
After a first denaturing step at 95°C for 2 min, 40 cycles of 1 
min at 95°C, 1 min at 49°C and 2 min at 72°C were passed. The 
last extension step was carried out at 72 °C for 8 min. PCR prod- 
ucts were subcloned into the pCR2 . 1 vector, with the TA Cloning 
Kit of Invitrogen being used, and sequenced. The products of this 
PCR were two DNA fragments with lengths of 744 bp and 780 bp, 
both DNA fragments having the same 5 '-end (cf. also Fig. 7). 
Starting from these two DNA fragments, the missing 5' and 3' re- 
gions of the cDNA were obtained by 5' and 3' rapid amplification 
of cDNA ends (RACE) , wherein the RACE Kit of Gibco-BRL was used. 
As the antisense primer, the universal amplification primer of 
the kit, and as the sense primer, either 5 ' -CTGGAACTGTCCCTGTGGTT- 
3' (SEQ ID NO: 12) or 5'- AGTGCACTAGAGGGCCAGAA-3 ' (SEQ ID NO: 13) 
were used. As the sense primer, also the shortened anchor primer 
of the kit, and as the antisense primer, 5 ' -TTCGAGCACCA- 
CAATTGGAAAT-3 ' (SEQ ID NO: 14) or 5 ' -GAATGCAAAGACGGCACGATGAAT-3 ' 
(SEQ ID NO: 15) were used. 

The PCR was carried out with an annealing temperature of 55°C and 
under the above-described conditions. The 5' and 3' RACE products 
were subcloned into the pCR2 . 1 vector and sequenced: The se- 
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guences of the subcloned fragments were sequenced by means of the 
didesoxynucleotide method (ABI PRISM Dye Terminator Cycle Se- 
quencing Ready reaction Kit and ABI PRISM 310 Genetic analyser 
(Perkin Elmer) ) . T7 and M13 forward primers were used for the se- 
quencing of the products cloned into vector pCR2 . 1 . Both strands 
of the coding region were sequenced by the Vienna VBC Genomics- 
Sequencing Service, infrared-labelled primers (IRD700 and IRD800) 
and an LI-COR Long Read IR 4200 Sequencer (Lincoln, NE) being 
used. 

Figs. 5a and 5b show the entire cDNA which has a size of 2198 bp 
and an open reading frame of 153 0 bp (SEQ ID NO: 1) . The open 
reading frame (start codon at base pairs 211-213, stop codon at 
base pairs 1740-1743) codes for a protein of 510 amino acids hav- 
ing a molecular weight of 56.8 kDA and a theoretical pi value of 
7.51. 

Figs . 6a and 6b show the cDNA-derived amino acid sequence of the 
GlcNAc-al, 3-fucosyl transferase (SEQ ID NO: 2). Sites for the as- 
paragine-bound glycosylation are at Asn346 and Asn429. 
In Fig. 7, the schematic GlcNAc-al , 3-fucosyl trans ferase-cDNA 
(top) and the derived hydrophobicity index of the encoded protein 
(bottom) are illustrated, a positive hydrophobicity index meaning 
an increased hydrophobicity. Therebetween, the sizes of the two 
above-indicated PCR products are shown in relationship to the 
complete cDNA . The coding region is -illustrated by the beam, "C" 
coding for the postulated cytoplasmatic region, T for the postu- 
lated transmembrane region, and G for the postulated Golgi lumen 
catalytic region of transferase. The analysis of the DNA sequence 
by "TMpred" (from EMBnet, Switzerland) gave an assumed transmem- 
brane region between Asn36 and Gly54. The C-terminal region of 
the enzyme probably comprises the catalytic region and conse- 
quently should point into the lumen of the Golgi apparatus. Ac- 
cording to this, this transferase seems to be a type II 
transmembrane protein like all the hitherto analyzed glycosyl 
transferases which are involved in glycoprotein biosynthesis 
(Joziasse, 1992, Glycobiology 2, 271-277) . The gray regions rep- 
resent the four tryptic peptides, the hexagons represent the po- 
tential N-glycosylation sites. A'BLASTP search in all data banks 
accesible via NCBI showed a similarity between the GlcNAc-ocI , 3- 
fucosyl transferase and other ocl , 3 /4-f ucosyl transferases, e.g. 
human fucosyl transferase VI. At 18-21% (examined by SIM-LALN- 
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VIEW, Expase, Switzerland) , the total similarity was beyond any 
significance. Nevertheless, a sequence range of 35 amino acids 
(SEQ ID NO: 4) shows a strikingly high homology to other otl,3/4- 
fucosyl transferases (Fig. 8) . This sequence region is located 
between Glu267 and Pro301 of SEQ ID NO: 2. 
Example 5 : 

Expression of recombinant GlcNAc-al . 3 -fucosyl transferase in 
insect cells 

The encoding region of the assumed GlcNAc-'al , 3-f ucosyl transfe- 
rase including cytoplasmatic and transmembrane region was ampli- 
fied with the forward primer 5 ' -CGGCGGATCCGCAATTGAATGATG-3 ' (SEQ 
ID NO: 16) and reverse primer 5 ' -CCGGCTGCAGTACCATTTAGCGCAT-3 ' 
(SEQ ID NO: 17) by means of the Expand High Fidelity PCR System 
of Boehringer Mannheim. The PCR product was double-digested with 
PstI and BamHI and subcloned in alkaline phosphatase-treated ba- 
culovirus transfer vector pVL1393 which previously had been di- 
gested with PstI and BamHI. To ensure a homologous recombination, 
the transfer vector was co-transf ected with Baculo Gold viral DNA 
(PharMingen, Sand Diego, CA) in Sf9 insect cells in I PL- 41 Medium 
with lipofectin. After an incubation of 5 days at 27°C, various 
volumes of the supernatant with the recombinant virus were used 
for infecting the Sf21 insect cells. After an incubation of 4 
days at 27°C in IPL-41 Medium with 5% FCS, the Sfl cells were 
harvested and washed 2x with phosphate-buffered saline solution. 
The cells were resuspended in 25 mM Tris HC1 buffer, pH 7.4, with 
2% Triton X-100 and broken up by sonication on ice. 
Example 6 : 

Assay for GlcNAc-al . 3-f ucosyl transferase activity 
The homogenate and. the cell supernatant were assayed for GlcNAc- 
al, 3-f ucosyl transferase. Blind samples were carried out with re- 
combinant baculovirus which codes for the tobacco-GlcNAc-transf e- 
rase I (Strasser et al . , 1999, Glycobiology , in the process of 
printing) . 

Fig. 9 shows the measured enzyme activity of the recombinant 
GlcNAc-al , 3 -fucosyl transferase as well as of the negative con- 
trol. At best, the enzyme activity of the cotransf ected cells and 
their supernatant was 3 Ox higher than that of the negative con- 
trols. This endogenous activity which is measurable in the ab- 
sence of the recombinant transferase, substantially comes from 
the insect-al , 6-f ucosyl transferase and only a low percentage 
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thereof comes from the GlcMAc-al , 3-f ucosyl transferase. Accord- 
ingly, the increase in the GlcNAc-ccl , 3-f ucosyl transferase coming 
from the recombinant baculoviruses is far more than the 100-fold. 
The enzyme exhibited a broad maximum activity around a pH of 7.0, 
if the activity was measured in 2- (N-morpholino ) -ethanesulf onic 
acid-HCl buffer. As is apparent in Table 2, the addition of biva- 
lent cations, in particular Mn 2+ , enhances the activity of the re- 
combinant transferase . 

Table 2 

Additive Relative Activity 

(cone. 10 mM). (Acceptor: GnGn-peptide) 



none 


21 


EDTA 


18 


MnCl 2 


100 


CaCl 2 


82 


MgCl 2 


52 


CdCl 2 


44 


CoCl 2 


35 


CuCl 2 


3 


NiCl 2 


24 


ZnCl 2 


0. 



Table 3 shows that among the acceptors used, the GnGn-pep- 
tide exhibits the highest incorporation rates under standard 
test conditions, followed closely by GnGnF 6 eptide and M5Gn-Asn. A 
transfer to the MM peptide could not be found, which MM peptide 
does not comprise the reducing GlcNAc-end at the 3-bound mannose. 
This structure seems to be necessary for the core fucosyl trans- 
ferase. The recombinant transferase, moreover, was inactive rela- 
tive to the acceptors commonly used, the a, 3 /4-f ucosyl 
transferases used for determining the blood groups, which trans- 
fer the fucose to GlcNAc at the non-reducing ends of oligosaccha- 
rides. The apparent K m -values for the acceptor substrate GnGn 
peptide, GnGnF 6 peptide , M5Gn-Asn, and for the donor substrate 
GDP-fucose, were assessed to be 0.19, 0.13, 0.23 and 0.11, re- 
spectively. The structures of the molecules are illustrated in 
Figs. 10a and 10b. 
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Table 3 



Acceptor Substrate 


Rel . Activity 


I^-Value 




% 


mM 


GnGn-peptide 


100 


0.19 


GnGnF 6 -peptide 


87 


0 . 13 


M5Gn-Asn 


71 


0.23 


MM-peptide 


0 




Gal(3-4GlcNAc 


0 




Gal(3l-3GlcNAc 


0 




Gal(3l-3GlcNAcpl-3Galpl-4Glc 


0 





Example 7 : 

Mass spectrometry of the fucosvl transferase product 
Dabsylated GnGn hexapeptide (2 nmol) was incubated with the 
insect cell homogenate comprising the recombinant GlcNAc-a, 3-f u- 
cosyl transferase (0.08 mU) in the presence of non-radioactive 
GDP-L-fucose (10 nmol), 2 (N-morpholino) -ethanesulf onic acid-HCl 
buffer, Triton X-100, MnCl 2 , GlcNAc and AMP. A negative control 
was carried out with a homogenate of the infected insect cells 
for the blind samples. The samples were incubated for 16 h at 
3 7°C and analyzed by means of MALDI TOF mass spectrometry. 
Mass spectrometry was performed on. a DYNAMO (Therrmo BioAnaly- 
sis, Santa Fe, NM) , a MALDI -TOF MS which is capable of dynamic 
extraction (synonym for late extraction) . Two types of sample ma- 
trix preparations were used: peptides and dabsylated glycopep- 
tides were dissolved in 5% formic acid, and aliquots were applied 
to the target, air-dried, and covered with 1% oc-cyano-4-hydroxy 
cinnamic acid. Pyridyl-aminated glycans , reduced oligosaccharides 
and non-derivatized glycopeptides were diluted with water, ap- 
plied to the target and air-dried. After addition of 2% 2.5-dihy- 
droxy benzoic acid, the samples were immediately dried by 
applying a vacuum. 

Fig. 11 shows the mass spectrum of these samples, A being 
the negative control: The main peak (S) shows the Dabsyl-Val-Gly- 
Glu- (GlcNAc 4 Man 3 ) Asn-Arg-Thr substrate, the calculated [M+H]- 
value being 2262.3. This substrate also appears as sodium addi- 
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tion product and as smaller ion which has been formed by fragmen- 
tation of the Azo function of the Dabsyl group, at (S*) . A small 
product amount (P, [M+H] + = 2408.4) is a consequence of- the en- ^ 
dogenous al,6-fucosyl transferase. The peak at m/z = 2424.0 
shows the incomplete de-galactosylation of the substrate. The 
mass spectrum B shows the sample with recombinant al,3-fucosyl 
transferase. The main peak (P) represents the fucosylated prod- 
uct, (P*) its fragmented ion. 

In addition, aliquots of both samples were mixed with each 
other so as_ to obtain similar concentrations of substrate and 
product (sample A). This mixture was diluted with 0.1 M ammonium 
acetate, pH 4.0, comprising 10 mU of N-glycosidase A (sample B) , 
or with 50 mM Tris/HCl, pH 8.5, comprising 100 mU (1 U hydrolyses 

1 mmol of substrate per min) of N-glycosidase F (sample C) . After 

2 and 20 h, small aliquots of these mixtures were taken and ana- 
lyzed by means of MALDI-TOF MS. 

In Fig. 12, the three mass spectra of samples A, B and Care 
illustrated. The undigested sample A shows two main peaks: the 
substrate at 22 61.4 m/z, and the fucosylated product at 2407.7 
m/z. The middle curve shows the mass spectrum of sample B, 
treated with N-glycosidase A, which hydrolyses both glycopep- 
tides. The peak at 963.32 constitutes the deglycosylated product. 
The lower curve shows the mass spectrum of sample C. The N-glyco- 
sidase F is not able to hydrolyse a 1 , 3-f ucosylated substrates, 
so that the spectrum has the peak at 2406.7 m/z of the fucosy- 
lated product, whereas the peak of the hydrolysed substrate ap- 
pears at 963.08 m/z. 

Example 8 : 

HPLC-analysis of the pyridyl-aminated fucosvl transferase 
product ■ 

The two above-described samples (fucosylated product and negative 
control) were digested with N-glycosidase A. The oligosaccharides 
obtained were pyridyl-aminated and analysed by means of reverse 
phase HPLC (Wilson et al . , 1998, glycobiology 8, 651-661; Kubelka 
et al., 1994, Arch. Biochem. Giophys . 308, 148-157; Hase et al . , 
1984, J. Biochem. 95, 197-203). 

In Fig. 13, the top diagram B represents the negative control, 
wherein in addition to the residual substrate (GnGn-peptide) a 
1 , 6-f ucosylated product is visible. A has a peak at a substan- 
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tially shorter retention time, which is specific of reducing fu- 
cose bound to GlcNAc-cx 1,3. 

In the bottom diagram, the isolated transferase product prior to 
(curve A) and following (curve B) digestion by N-acetyl-pgluco- 
saminidase was compared with MMF 3 honeybee phospholipase A 2 (curve 
C) . 
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SEQUENCE LISTING 

<l\0> Altmann Dr., Friedrich 

<12o\ alpha 1 , 3-f ucosyltransf erase R35063 

<130> Dycosyl transferase gene 

<140> 
<141> 

<160> 17 

<170> Patentln Ve\. 2.1 

<210> 1 
<211> 2198 
<212> DNA 
<213> plant 



<400> 1 

actaactcaa 

aaaaaacaac 

tcatggaatc 

tgcggaattg 

tcgagaacag 

aagaggaaat 

gcgtttctgg 

ttcttctacc 

tctgatcgga 

acgtattcga 

aagtcgtgtt 

tttgggttac 

tactatgctg 

accagtctat 

gcaccagtgc 

gctcgaaatt 

tcttatggtg 

cactacaaat 

ttcttccaat 

gactttgctc 

gttgcaaaga 

tggaagtatg 

tcatcgtgcc 

ccaagcctta 

tatgtcagag 

ac tctgaatg 

gtatggaaga 



acgctgcatt 
agcaagctgt 
gtgctcataa 
gcagttgggg 
atggtgccca 
ggagcaatct 
gtaggttgga 
gctctcgagc 
attctgaatc 
ggggcttttc 

cggttggatg 
ctcaaccaag 
agaacaatat 
cttcggatgt 
agccgaaaac 
tccggttgca 
gttgtcacag 
ttagcttagc 
cccttgttgc 
cttctcctgg 
ccatgagata 
agggtccatc 
gtctttgcat 
agagacgtcc 
aaaggggaag 
c tgtgaaggc 
ctgaaaggcc 



ttctttkttc 
gttttttt 
cgccaaaatl 
gcgcaattga 
acaagacagc 
aatgcctctt 
tatggccaaa 
ggtcgttgaa 
gtatagttgt 
caaagagcct 
taaatttggg 
tggaacagct 
tgccatggca 
tcctgttgga 
tgaagctgct 
agctcttgag 
gaaccgtgat 
gtttgaaaat 
tggaactgtc 
ttcaatttta 
tctagcagaa 
tgactccttc 
tcacttggcc 
ttgcaagtgc 
gtttgagatg 
tgctgttgtt 
tgaagttata 



tttcagggaa 
tcgttctttt 
ttccatttcc 
tgatgggtc 
t\acccgttt 
gtogttgccc 
aacgfccgcca 
ggtgaogatt 
gaggaat?§gt 
attttt< 
tttagtgggc 
agcattctgc 
agacggaggg 
tatttttcat 
cttgcagctg 
gcccttgaaa 
ggaagagtga 
tcgaatgagg 
cctgtggttg 
catattaaag 
aatcccgaag 
aaggccc ttg 
acagtgagta 
actagagggc 
gagtccattt 
ttgaagttca 
agagggggga 



ccatccaccc 
tctttaaaca 
ctttgatttt 
tgttgacgaa 
tggctccggg 
ttgtggtcat 
tggttgactc 
tggggttggg 
tggagaggga 
ctggagctga 
tagaaagcc 
tcaatgga 
ga\ataacat 
gggO^gagta 
ctttcatttc 
aatcaaacat 
acaaagtgg^a 
aagattatgcS 
ttggtgctcc 
agatagagga 
catataatca 
tggatatggc 
gagagaagga 
cagaaaccgt 
acctgaggtc 
catccctgaa 
gtgct ttaaa 



ataacaacaa 
agcaccccca 
tagtttattt 
tcttcgaggc 
aggcaaccca 
cgcggagatc 
cctcgctgac 
tttggtggct 
ggatgctgtc 
tcaggagtgg 
agatgccgca 
atcagcagaa 
cgtaatgaca 
tgatatgatg 
caattgtggt 
caaaattgat 
agccctgaag 
aactgaaaaa 
atattcag 
tgVtgagtct 
atcaVtgagg 
agctgts^cat 
at 

atatcataDyC 
tagcaattta 
tcttgtgcct 
actctacaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
,1500 
i60 
1^20 
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atatacccaa ttggcttgac acagagacaa gctctttata ccttcagctt caaaggtgat 1680 

gctgatttca ggagtcactt ggagaacaat ccttgtgcca agtttgaagt catttttgtg 1740 

tagcatgcgc taaatggtac ctctgctcta cctgaattag cttcacttag ctgagcacta 1800 

gctagagttt taggaatgag tatggcagtg aatatggcat ggctttattt atgcctagtt 1860 

tcttggccaa ctcattgatg ttttgtataa gacatcacac tttaatttta aacttgtttc 1920 

tgtagaagtg caaatccata tttaatgctt agttttagtg ctcttatctg atcatctaga 1980 

agtcacagtt cttgtatatt gtgagtgaaa actgaaatct aatagaagga tcagatgttt 2040 

cactcaagac acattattac ttcatgttgt tttgatgatc tcgagctttt ttagtgtctg 2100 

gaactgtccc tgtggtttga gcacctgtta ttgcttcagt gttactgtcc agtggttatc 2160 

gtttttgacc tctaaaaaaa aaaaaaaaaa aaaaaaaa 2198 

<210> 2 

<211> 510" " - - - 

<212> PRT 
<213> plant 

<400> 2 

Met Met Gly Leu Leu Thr Asn Leu Arg Gly Ser Arg Thr Asp Gly Ala 
15 10 15 

Gin Gin Asp Ser Leu Pro Val Leu Ala Pro Gly Gly Asn Pro Lys Arg 
20 25 30 

Lys Trp Ser Asn Leu Met Pro Leu Val Val Ala Leu Val Val lie Ala 
35 40 45 

Glu lie Ala Phe Leu Gly Arg Leu Asp Met Ala Lys Asn Ala Ala Met 
50 55 60 

Val Asp Ser Leu Ala Asp Phe Phe Tyr Arg Ser Arg Ala Val Val Glu 
65 70 75 80 

Gly Asp Asp Leu Gly Leu Gly Leu Val Ala Ser Asp Arg Asn Ser Glu 
85 90 95 

Ser Tyr Ser Cys Glu Glu Trp Leu Glu Arg Glu Asp Ala Val Thr Tyr 
100 105 110 

Ser Arg Gly Phe Ser Lys Glu Pro lie Phe Val Ser Gly Ala Asp Gin 
115 120 125 

Glu Trp Lys Ser Cys Ser Val Gly Cys Lys Phe Gly Phe Ser Gly Asp 
130 135 140 

Arg Lys Pro Asp Ala Ala Phe Gly Leu Pro Gin Pro Ser Gly Thr Ala 
145 150 155 160 



Ser lie Leu Arg Ser Met Glu Ser Ala Glu Tyr Tyr Ala Glu Asn Asn 



lie Ala Met Ala 
180 

Leu Ser Ser Asp 
195 

Met Met Ala Pro 
210 

Phe lie Ser Asn 

225 - " 

Ala Leu Glu Lys 



Arg Asn Arg Asp 
260 

Lys Phe Ser Leu 
275 

Glu Lys Phe Phe 
290 

Gly Ala Pro Asn 
305 

His lie Lys Glu 



Tyr Leu Ala Glu 
340 

Tyr Glu Gly Pro 
355 

Val His Ser Ser 
370 

Glu Lys Glu Glu 
385 

Thr Arg Gly Pro 



Arg Phe Glu Met 



165 

Arg Arg Arg Gly 



Val Pro Val Gly 
200 

Val Gin Pro Lys 
215 

Cys Gly Ala Arg 
23 0 

Ser Asn lie Lys 
245 

Gly Arg Val Asn 



Ala Phe Glu Asn 
280 

Gin Ser Leu Val 
295 

lie Gin Asp Phe 
310 

lie Glu Asp Val 

325 

Asn Pro Glu Ala 



Ser Asp Ser Phe 
360 

Cys Arg Leu Cys 
375 

Asn Asn Pro Ser 
390 

Glu Thr Val Tyr 
405 

Glu Ser lie Tyr 
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170 

Tyr Asn lie Val 
185 

Tyr Phe Ser Trp 



Thr Glu Ala Ala 
220 

Asn Phe Arg Leu 

- 235 - 

lie Asp Ser Tyr 
250 

Lys Val Glu Ala 
265 

Ser Asn Glu Glu 



Ala Gly Thr Val 
300 

Ala Pro Ser Pro 
315 

Glu Ser Val Ala 
330 

Tyr Asn Gin Ser 
345 

Lys Ala Leu Val 



lie His Leu Ala 
380 

Leu Lys Arg Arg 
395 

His lie Tyr Val 
410 

Leu Arg Ser Ser 



Met Thr Thr Ser 
190 

Ala Glu Tyr Asp 
205 

Leu Ala Ala Ala 



Gin Ala Leu Glu 

■- - -240 

Gly Gly Cys His 
255 

Leu Lys His Tyr 
270 

Asp Tyr Val Thr 
285 

Pro Val Val Val 



Gly Ser lie Leu 
320 

Lys Thr Met Arg 
335 

Leu Arg Trp Lys 
350 

Asp Met Ala Ala 
365 

Thr Val Ser Arg 



Pro Cys Lys Cys 
400 

Arg Glu Arg Gly 
415 

Asn Leu Thr Leu 



420 

Asn Ala Val Lys 
435 

Val Pro Val Trp 
450 

Ala Leu Lys Leu 
465 

Ala Leu Tyr Thr 



Leu Glu Asn Asn 
500 



Ala Ala Val Val 
440 

Lys Thr Glu Arg 
455 

Tyr Lys lie Tyr 
470 

Phe Ser Phe Lys 

485 - - - - 

Pro Cys Ala Lys 



35 - 

425 

Leu Lys Phe Thr 



Pro Glu Val lie 
460 

Pro lie Gly Leu 
475 

Gly Asp Ala Asp 
490 

Phe Glu Val lie 
505 



430 

Ser Leu Asn Leu 
445 

Arg Gly Gly Ser 



Thr Gin Arg Gin 
480 

Phe Arg Ser His 

495 - 

Phe Val 
510 



<210> 3 
<211> 105 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cDNA 
<400> 3 

gaagccctga agcactacaa atttagctta gcgtttgaaa attcgaatga ggaagattat 60 
gtaactgaaa aattcttcca atcccttgtt gctggaactg tccct 105 

<210> 4 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide 
<400> 4 

Glu Ala Leu Lys His Tyr Lys Phe Ser Leu Ala Phe Glu Asn Ser Asn 
15 10 15 

Glu Glu Asp Tyr Val Thr Glu Lys Phe Phe Gin Ser Leu Val Ala Gly 
20 25 30 



Thr Val Pro 
35 



- 36 - 



<210> 5 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide 
<400> 5 

Lys Pro Asp Ala Xaa Phe Gly Leu Pro Gin Pro Ser Thr Ala Ser 
15 10 15 

<210> 6 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide 
<400> 6 

Pro Glu Thr Val Tyr His lie Tyr Val Arg 
15 10 



<210> 7 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide 

<400> 7 . 
Met Glu Ser Ala Glu Tyr Tyr Ala Glu Asn Asn lie Ala 
15 10 



<210> 8 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide 



<400> 8 

Gly Arg Phe Glu Met Glu Ser lie Tyr Leu 
15 10 



<210> 9 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA 

<400> 9 

gcngartayt aygcngaraa yaayathgc 

<210> 10 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :DNA 
<400> 10 

crtadatrtg rtanacngty tc 

<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA 
<400> 11 

tadatnswyt ccatytcraa 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA 



<400> 12 

ctggaactgt ccctgtggtt 
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<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :DNA 
<400> 13 

agtgcactag agggccagaa 

<210> 14 ■ - - - - - - - - 

<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :DNA 
<400> 14 

ttcgagcacc acaattggaa at 

<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA 
<400> 15 

gaatgcaaag acggcacgat gaat 

<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:DNA 
<400> 16 

cggcggatcc gcaattgaat gatg 

<210> 17 
<211> 25 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DNA 
<400> 17 

ccggctgcag taccatttag cgcat 



